Spelling Error Trends and Patterns in Sindhi

نویسندگان

  • Zeeshan Bhatti
  • Imdad Ali Ismaili
  • Asad Ali Shaikh
  • Waseem Javaid
چکیده

Statistical error Correction technique is the most accurate and widely used approach today, but for a language like Sindhi which is a low resourced language the trained corpora’s are not available, so the statistical techniques are not possible at all. Instead a useful alternative would be to exploit various spelling error trends in Sindhi by using a Rule based approach. For designing such technique an essential prerequisite would be to study the various error patterns in a language. This paper presents various studies of spelling error trends and their types in Sindhi Language. The research shows that the error trends common to all languages are also encountered in Sindhi but their do exist some error patters that are catered specifically to a Sindhi language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Sindhi Spelling Error Patterns for Spelling Error Detection and Correction

Statistical analysis of spelling error trends in a language plays important role in automatic spelling error detection and correction. Comprehensive statistical analysis of spelling error trends for Sindhi is still subject of research. This research study identifies and analyses the spelling error trends in Sindhi. The statistical analysis of error trends is based on a real time corpus collecte...

متن کامل

Development of Unicode based Sindhi Typing System

This paper presents a first attempt in designing and development of Unicode based Sindhi Typing System for the Sindhi speaking community. The Sindhi Typing project is developed in order to improve the typing speed of Sindhi computing professionals as no such system currently exists. It is Platform independent application requiring no third party plugin or any regional languages support. No Sind...

متن کامل

Spelling Error Trends in Urdu

Today the most accurate error correction techniques are statistical. But for low resourced languages like Urdu, where training error corpora are not available, statistical techniques are out of the question. Rule based techniques that exploit spelling error trends provide a useful alternative. The study of error patterns in a language is an essential prerequisite for designing such techniques. ...

متن کامل

Spelling Error Patterns in Spanish for Word Processing Applications

This paper reports findings from the elaboration of a typology of spelling errors for Spanish. It also discusses previous generalizations about spelling error patterns found in other studies and offers new insights on them. The typology is based on the analysis of around 76K misspellings found in real-life texts produced by humans. The main goal of the elaboration of the typology was to help in...

متن کامل

Design and implementation of Persian spelling detection and correction system based on Semantic

Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors.  Also developing Persian tools will provide Persian progr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1403.4759  شماره 

صفحات  -

تاریخ انتشار 2012